Modeling dependency between regression classes in MLLR using multiscale autoregressive models
نویسندگان
چکیده
Adapting acoustic models to a new environment is usually realized by considering model transformations that are estimated on the adaptation corpus. Since such a corpus usually contains very few data, the models' Gaussians are most often partitioned into a few regression classes, and all the Gaussians in the same class share the same transformation. It is further possible to increase the number of transformations by modeling the dependency between the regression classes. We present, in this paper, such a technique where dependency is modeled by multiscale autoregressive (MAR) processes. The power of the MAR framework resides in its ability to efficiently and optimally estimate the state vector at each node of the regression tree, based on sparse and noisy measurements at different resolutions. The method is evaluated on a french numbers recognition task where the test corpus has been recorded in a car at various speeds and noise levels. The proposed adaptation method is based on Maximum Likelihood Linear Regression.
منابع مشابه
A novel target-driven MLLR adaptation algorithm with multi-layer structure
This paper presents a novel target-driven MLLR adaptation algorithm with multiply layer structure, which is based on the thorough analysis of MLLR using the generation of regression class trees. The new algorithm is constructed on the targetdriven principal. It generates the regression class dynamically, basing on the outcome of the former MLLR transformation. The regression classes is defined ...
متن کاملContext dependent tree based transforms for phonetic speech recognition
This paper presents a novel method for modeling phonetic context using linear context transforms. Initial investigations have shown the feasibility of synthesising context dependent models from context independent models through weighted interpolation of the peripheral states of a given hidden markov model with its adjacent model. This idea can be further extended, to maximum likelihood estimat...
متن کاملRapid speaker adaptation using MLLR and subspace regression classes
In recent years, various adaptation techniques for hidden Markov modeling with mixture Gaussians have been proposed, most notably MAP estimation and MLLR transformation. When the amount of adaptation data is limited, adaptation can be done by grouping similar Gaussians together to form regression classes and then transforming the Gaussians in groups. The grouping of Gaussians is often determine...
متن کاملMulti-layer structure MLLR adaptation algorithm with subspace regression classes and tying
MLLR is a parameter transformation technique for both speaker and environment adaptation. When the amount of adaptation data is scarce, it is necessary to do adaptation with regression classes. In this paper, we present a rapid MLLR adaptation algorithm, which is called Multi-layer structure MLLR adaptation with subspace regression classes and tying (SRCMLR). The method groups the Gaussians on ...
متن کاملModified Maximum Likelihood Estimation in First-Order Autoregressive Moving Average Models with some Non-Normal Residuals
When modeling time series data using autoregressive-moving average processes, it is a common practice to presume that the residuals are normally distributed. However, sometimes we encounter non-normal residuals and asymmetry of data marginal distribution. Despite widespread use of pure autoregressive processes for modeling non-normal time series, the autoregressive-moving average models have le...
متن کامل